QSAR study for macromolecular RNA folded secondary structures of mycobacterial promoters with low sequence
نویسندگان
چکیده
a Department of Organic Chemistry, University of Santiago de Compostela, 15782, Spain. b Chemical Bioactives Center and Department of Veterinary Medicine, Central University of ‘Las Villas’, 54830, Cuba. c Department of Ultrasound Medicine, Calixto, Las Tunas, 77400, Cuba. ______________________________________________________________________________ Abstract. The general belief is that quantitative structure-activity relationships (QSAR) techniques work only for small molecules and, proteins sequences or, more recently, DNA sequences. However, with non-branched graph for proteins and DNA sequences the QSAR often have to be based on powerful non-linear techniques such as support vector machines. In our opinion linear QSAR models based in RNA could be useful to assign biological activity when alignment techniques fail due to low sequence homology. The idea bases in the high level of branching for the RNA graph. This work introduces the so called Markov electrostatic potentials ξM as a new class of RNA 2D-structure descriptors. Subsequently, we validate these molecular descriptors solving a QSAR classification problem for mycobacterial promoter sequences (mps), which constitute a very low sequence homology problem. The model developed (mps = –4.664·ξM + 0.991·ξM – 2.432) was intended to predict whether a naturally occurring sequence is an mps or not on the basis of the calculated ξM value for the corresponding RNA secondary structure. The RNAQSAR approach recognises 115/135 mps (85.2%) and 100% of control sequences. Average predictability and robustness were greater than 95%. A previous non-linear model predicts mps with slightly higher accuracy (97%) but uses a very large parameter space for DNA sequences. Conversely, the ξM-based RNA-QSAR encodes more structural information and needs only two variables. _____________________________________________________________________________
منابع مشابه
Relation Between RNA Sequences, Structures, and Shapes via Variation Networks
Background: RNA plays key role in many aspects of biological processes and its tertiary structure is critical for its biological function. RNA secondary structure represents various significant portions of RNA tertiary structure. Since the biological function of RNA is concluded indirectly from its primary structure, it would be important to analyze the relations between the RNA sequences and t...
متن کاملPhylogenetic Analysis of Beta-Glucanase Producing Actinomycetes Strain TBG-CH22 - A Comparison of Conventional and Molecular Morphometric Approach
Actinomycetes are inexhaustible producers of commercially valuable metabolites, are continually screened for beneficial compounds. The taxonomic and phylogenetic study of novel actinomycetes strains are mostly based on conventional methods and primary DNA structure of 16s rRNA. Although 16s rRNA sequence is well accepted in phylogeny studies, its secondary structures have not been widely used. ...
متن کاملPhylogenetic Analysis of Beta-Glucanase Producing Actinomycetes Strain TBG-CH22 - A Comparison of Conventional and Molecular Morphometric Approach
Actinomycetes are inexhaustible producers of commercially valuable metabolites, are continually screened for beneficial compounds. The taxonomic and phylogenetic study of novel actinomycetes strains are mostly based on conventional methods and primary DNA structure of 16s rRNA. Although 16s rRNA sequence is well accepted in phylogeny studies, its secondary structures have not been widely used. ...
متن کاملMycobacterial transcriptional signals: requirements for recognition by RNA polymerase and optimal transcriptional activity
Majority of the promoter elements of mycobacteria do not function well in other eubacterial systems and analysis of their sequences has established the presence of only single conserved sequence located at the -10 position. Additional sequences for the appropriate functioning of these promoters have been proposed but not characterized, probably due to the absence of sufficient number of strong ...
متن کاملRNA Structures as Mediators of Neurological Diseases and as Drug Targets
RNAs adopt diverse folded structures that are essential for function and thus play critical roles in cellular biology. A striking example of this is the ribosome, a complex, three-dimensionally folded macromolecular machine that orchestrates protein synthesis. Advances in RNA biochemistry, structural and molecular biology, and bioinformatics have revealed other non-coding RNAs whose functions a...
متن کامل